#####
Replication Data for "Legislative Bellwethers"
Version 1
2 Oct 2018
Contact: goplerud@g.harvard.edu
#####

The R code for the main analysis can be found in bellwethers_replication_LSQ.R

The code used to generate the underlying classification of speeches can be found in bellwethers_classification.py

All figures are saved into the "figures" folder. Everything here can be produced by running the two replication files noted above.

The human-annotated coding used to validate the classification on speeches can be found in "gold_annotations".

In the "data" folder, the objects are as follows:
    "1999-2014_CHES_trend" is the Chapel Hill Expert Survey.
    bills_with_classes(_v2) includes the texts of the bills and their committee jurisdictions.
    classifed_speeches contains the predictions from the python output that can be merged into the metadata.
    dar_no_gogvernment_ids_[0-9] is the speech data
    Formatted Gov Data maps the CMP to committee profiles.
    MPDatatset_MPDS2016b is the Comparative Manifestos Project data
    nomes_clean has miscellaneous information the Portugese MPs.
    stata_replication is a cleaned and processed dataset that can be used to replicate the main results
    svm_codes.RDS gives informative labels to the classification categories.
